Novel Lossy Compression Algorithms with Stacked Autoencoders
نویسندگان
چکیده
Lossy compression is a strategy to reduce the size of data while maintaining the majority of its useful or meaningful information. Though some data is lost in the compression process, lossy compression algorithms try to remove data that is unlikely to be considered important by a human consumer. Given this goal, today’s lossy compression techniques are carefully designed to model human perception as closely as possible in order to maximize the amount of valuable retained information for a given data compression level. The popular JPEG image compression algorithm, for example, is reliant upon the observation that human visual perception is far more sensitive to low-frequency spatial variation than high-frequency spatial variation. Similarly, the MP3 audio compression algorithm uses carefully-tuned psychoacoustic models to make inferences about which components of a given audio stream are most perceived by a human listener. These perceptual coding methods exhibit quite impressive compression results, but require a great deal of work among psychologists, computer scientists, and engineers, and are ineffective or even detrimental when applied in situations for which they were not designed. Using JPEG to compress line art, for example, often yields undesirable results. It is thus valuable to be able to easily develop a compression method that is naturally tuned to a particular type of data. This can be accomplished by extracting statistical regularities in the data and then using this information to compress the data. We demonstrate that one can efficiently compress data through the careful application of deep-belief networks while maintaining the data’s most meaningful characteristics.
منابع مشابه
Lossy Image Compression with Compressive Autoencoders
We propose a new approach to the problem of optimizing autoencoders for lossy image compression. New media formats, changing hardware technology, as well as diverse requirements and content types create a need for compression algorithms which are more flexible than existing codecs. Autoencoders have the potential to address this need, but are difficult to optimize directly due to the inherent n...
متن کاملفشردهسازی تصویر با کمک حذف و کدگذاری هوشمندانه اطلاعات تصویر و بازسازی آن با استفاده از الگوریتم های ترمیم تصویر
Compression can be done by lossy or lossless methods. The lossy methods have been used more widely than the lossless compression. Although, many methods for image compression have been proposed yet, the methods using intelligent skipping proper to the visual models has not been considered in the literature. Image inpainting refers to the application of sophisticated algorithms to replace lost o...
متن کاملTraining Stacked Denoising Autoencoders for Representation Learning
We implement stacked denoising autoencoders, a class of neural networks that are capable of learning powerful representations of high dimensional data. We describe stochastic gradient descent for unsupervised training of autoencoders, as well as a novel genetic algorithm based approach that makes use of gradient information. We analyze the performance of both optimization algorithms and also th...
متن کاملMarginalized Stacked Denoising Autoencoders
Stacked Denoising Autoencoders (SDAs) [4] have been used successfully in many learning scenarios and application domains. In short, denoising autoencoders (DAs) train one-layer neural networks to reconstruct input data from partial random corruption. The denoisers are then stacked into deep learning architectures where the weights are fine-tuned with back-propagation. Alternatively, the outputs...
متن کاملStacked Autoencoders for Medical Image Search
Medical images can be a valuable resource for reliable information to support medical diagnosis. However, the large volume of medical images makes it challenging to retrieve relevant information given a particular scenario. To solve this challenge, content-based image retrieval (CBIR) attempts to characterize images (or image regions) with invariant content information in order to facilitate im...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009